Counterexamples to the maximal p-norm 
multiplicativity conjecture for all p > 1 

Patrick Hay den 1 '0 and Andreas Winter 2 ' 3 '0 

1 School of Computer Science, McGill University, Montreal, Quebec, H3A 2A7, Canada 
2 Department of Mathematics, University of Bristol, University Walk, Bristol BS8 1TW, U. K. 
3 Centre for Quantum Technologies, National University of Singapore, 2 Science Drive 3, Singapore 117542 

(Dated: 30 July 2008) 

For all p > 1, we demonstrate the existence of quantum channels with non-multiplicative 
maximal output p-norms. Equivalently, for all p > 1, the minimum output Renyi entropy 
of order p of a quantum channel is not additive. The violations found are large; in all 
\ cases, the minimum output Renyi entropy of order p for a product channel need not be 

significantly greater than the minimum output entropy of its individual factors. Since p = 
1 corresponds to the von Neumann entropy these counterexamples demonstrate that if 
the additivity conjecture of quantum information theory is true, it cannot be proved as a 
consequence of any channel-independent guarantee of maximal p-norm multiplicativity. 
We also show that a class of channels previously studied in the context of approximate 
encryption lead to counterexamples for all p > 2. 
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I. INTRODUCTION 

P 

The oldest problem of quantum information theory is arguably to determine the capacity of a 
quantum-mechanical communications channel for carrying information, specifically classical bits 
of information. (Until the 1990's it would have been unnecessary to add that additional qualifi- 
cation, but today the field is equally concerned with other forms of information like qubits and 
ebits that are fundamentally quantum-mechanical.) The classical capacity problem long predates 
the invention of quantum source coding fl|, |2fl and was of concern to the founders of information 
theory themselvesO]. The first major result on the problem came with the resolution of a conjec- 
ture of Gordon's |4] by Alexander Holevo in 1973, when he published the first proof fa] that the 
maximum amount of information that can be extracted from an ensemble of states a; occurring 
with probabilities pi is bounded above by 

H; x{{Pi,<ri}) = H^^piCT,^ - ^2piH(ai), (1) 

. . i i i 

where H(a) = — Tr a In a is the von Neumann entropy of the density operator a. For a quantum 
channel M, one can then define the Holevo capacity 

X(AT) = max {p . )P . } x({lH,M'(jH)}), (2) 

where the maximization is over all ensembles of input states. Writing C(M) for the classical 
capacity of the channel N , this leads easily to an upper bound of 

COT) < lim -x(AO. (3) 

n^oo n 
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It then took more than two decades for further substantial progress to be made on the problem, but 
in 1996, building on recent advances (g], Holevo |@] and Schumacher- Westmoreland [8] managed 
to show that the upper bound in Eq. (0 is actually achieved. This was a resolution of sorts to the 
capacity problem, but the limit in the equation makes it in practice extremely difficult to evaluate. 
If the codewords used for data transmission are restricted such that they are not entangled across 
multiple uses of the channel, however, the resulting product state capacity Ci^M) has the simpler 
expression 

CWAO = X (A0. (4) 
The additivity conjecture for the Holevo capacity asserts that for all channels Mi and M 2 , 

x(Mi®M 2 ) = x (Mi) + x (M 2 ). (5) 

This would imply, in particular, that Ci^M) = C(N), or that entangled codewords do not in- 
crease the classical capacity of a quantum channel. 

In 2003, Peter Shor [9], building on several previously established connections El Ei, 



demonstrated that the additivity of the Holevo capacity, the additivity of the entanglement of 



formation )13l, 114, 1151, 116H and the superadditivity of the entanglement of formation [17] are all 
equivalent to another conjecture of Shor's which is particularly simple to express mathematically, 
known as the minimum output entropy conjecture [18[]. For a channel M, define 

H m ' m {N) = mmH(N(ip)), (6) 

\<p) 

where the minimization is over all pure input states | ip) . The minimum output entropy conjecture 
asserts that for all channels Mi and A2, 

H min {Mi ® M 2 ) = H min (Mi ) + H min (M 2 ) . (7) 

There has been a great deal of previous work on these conjectures, particularly inconclusive 
numerical searches for counterexamples, necessarily in low dimension, at Caltech, IBM, in Braun- 
schweig (IMaPh) and Tokyo (ERATO) [19], as well as proofs of many special cases. For example, 
the minimum ou tpu t entropy conjecture has been shown to hold if one of the channels is the 
identity channel [20, 21], a unital qubit channel [22], a generalized depolarizing channel |H, 24] 



or an entanglement-breaking channel ]. In addition, the weak additivity conjecture was 



confirmed for generalized dephasing channels [28], the conjugates of all these channels [29] and 
some other special classes of channels Jl6, 3(J 3ll, |32|]. Further evidence for qubit channels was 
supplied in [18]. This list is by no means exhaustive. The reader is directed to Holevo's reviews 



for a detailed account of the history of the additivity problem |33l,l34l] . 

For the past several years, the most commonly used strategy for proving these partial results 
has been to demonstrate the multiplicativity of maximal p-norms of quantum channels for p ap- 



proaching 1 [20j]. For a quantum channel M and p > 1, define the maximal p-norm of M to be 

v p {M) = sup {||Af(p)|| p ;p>0, Trp= l}. (8) 



In the equation, ||<r|| p = ( Tr |cr| p ) 1 ^ p . The maximal p-norm multiplicativity conjecture l|2oTl asserts that 
for all quantum channels Mi and M 2 , 

u p (Mi ® M 2 ) = v p (Mi)vp(M 2 ). (9) 
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This can be re-expressed in an equivalent form more convenient to us using Renyi entropies. 
Define the Renyi entropy of order p to be 

H p (p) = -^—lnTrpP (10) 
1 — p 

for p > 0, p 7^ 1. Since lim p ji H p (p) = H(p), we will also define Hi(p) to be H(p). All these 
entropies have the property that they are for pure states and achieve their maximum value of 
the logarithm of the dimension on maximally mixed states. Define the minimum output Renyi 
entropy H™ in by substituting H p for H in Eq. ©. Since F™ in (A0 = ^ In v p (N), Eq. <© can then 
be written equivalently as 

#7 n (AAi ® N 2 ) = Hf*{Ni) + H p ain (M 2 ), (11) 

in which form it is clear that the maximal p-norm multiplicativity conjecture is a natural strength- 
ening of the original minimum output entropy conjecture (0. 

This conjecture spawned a significant literature of its own which we will not attempt to sum- 



marize. Holevo's reviews are a gain an excellent source ||33l, 13411 . Some more recent important 
references include Unlike the von Neumann entropy case, however, some 

counterexamples had already been found prior to this paper. Namely, Werner and Holevo found 
a counterexample to Eq. ((TTj) for p > 4.79 JI4in that nonetheless doesn't violate the p-norm multi- 



plicativity conjecture for 1 < p < 2 I|42|,|43, 44]. 



Moreover, in 2007, Winter showed that a class of channels that had previously been studied 
in the context of approximate encryption provide counterexamples to the conjecture for all p > 
2 JH]. In light of these developments, the standing conjecture was that the maximal p-norm 
multiplicativity held for 1 < p < 2, corresponding to the region in which the map X h-> X p is 
operator convex [35]. More conservatively, it was conjectured to hold at least in an open interval 
(1, 1 + e), which would be sufficient to imply the minimum output entropy conjecture. On the 
contrary, shortly after Winter's discovery, Hayden showed that the conjecture is false for all 1 < 
p<2|S3]. 

The current paper merges and slightly strengthens JisT] and |0]. We begin in Section HH by 
presenting Winter's counterexamples from [45], which share some important features with |4a] 
but are simpler to analyze. Section [Til] then presents Hayden's counterexamples from [46] with an 
improved analysis showing that they work for all p > 1, not just 1 < p < 2. 

In particular, given p > 1, we show that there exist channels M\ and M 2 with output dimension 
d such that both (J\f\ ) an&Hf n {N 2 ) are equal to In d-0(l) but H™ n (Ni ®N 2 ) = \nd+0(l), 
so 

Hf*(Ni) + ^ p min (AA 2 ) - F^ in (M <g> M 2 ) = In d - Oil). (12) 

Thus, one finds that the minimum output entropy of the product channel need not be significantly 
larger than the minimum output entropy of the individual factors. Since [20, 24] 

Hf n (Ni ® M 2 ) > Hf n {Mi) = In d - 0(1), (13) 

these counterexamples are essentially the strongest possible for all p > 1, up to a constant additive 
term. (Note that the dependence of il? 1111 on p is absorbed here in the asymptotic notation.) 

At p = 1 itself, however, we see no evidence of a violation of the additivity conjecture for the 
channels we study. Thus, the conjecture stands and it is still an open question whether entangled 
codewords can increase the classical capacity of a quantum channel. 
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Notation: If A and B are finite dimensional Hilbert spaces, we write AB = A ® B for their 
tensor product and \A\ for dim A. The Hilbert spaces on which linear operators act will be denoted 
by a superscript. For instance, we write ip for a density operator on AB. Partial traces will be 
abbreviated by omitting superscripts, such as <p A = Ti b ^P AB - We use a similar notation for pure 
states, e.g. \i(j) ab £ AB, while abbreviating ip AB = \tp)(ip\ AB . We associate to any two isomorphic 
Hilbert spaces A ~ A' a unique maximally entangled state which we denote |<£) Aj4 . Given any 
orthonormal basis {(i)" 4 } for A, if we define \i) A = V\i) A where V is the associated isomorphism, 

we can write this state as l^)" 4 " 4 ' = |^4| -1 / 2 Yl\=i N)" 4 !*)" 4 '- W e wm a l so make use of the asymptotic 
notation f(n) = 0(g{n)) if there exists C > such that for sufficiently large n, |/(n)| < Cg{n). 
f(n) = Q(g(n)) is defined similarly but with the reverse inequality |/(n)| > Cg{n). Finally, 
f(n) = e(g(n)) if f(n) = 0{g{n)) and f(n) = Sl(g(n)). 



II. RANDOM UNITARY CHANNELS: p > 2 



This class of counterexamples, while only working for p > 2, has the advantage of being a 
straightforward application of well-known results. Later in the paper we will present stronger 
counterexamples that reuse the same basic strategy, albeit with some additional technical compli- 
cations. A random unitary channel is a map of the form 



n 



(14) 



i=l 



with the V{ unitary transformations of an underlying (finite dimensional) Hilbert space. Let d be 
the dimension of this space. Following [47|], we call J\f e-randomizing if for all p, 



M(p) 



-I 



d 



< 



d' 



(15) 



In that paper, it was shown that for < e < 1, e-randomizing channels exist in all dimensions 
d > — , with n = ^-dln d. In fact, randomly picking the Vi from the Haar measure on the unitary 
group will, with high probability, yield such a channel. 

Recently, it was shown by Aubrun ||4^1 that n can in fact be taken to be 0(d/e 2 ) for Haar dis- 
tributed Vi, and 0(d(ln d) 4 /e 2 ) for Vi drawn from any ensemble of exactly randomizing unitaries. 

Lemma II.l For a random unitary channel N and its complex conjugate, AT : p \-* ^ Yl VipVi , one has 

Proof We use the maximally entangled state |3>) = d^ 1 / 2 as test state, abbreviating 

$ = |$)($|: 

v p {N®J[) > 1 1 (A/ - ® AT)$|| 



l n 
n . . 



1 

n 
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where in the third line we have invoked the U (8> J7-invariance of <3? for the n terms when i = j. 
For the final inequality, observe that the largest eigenvalue Ai of (Af (8> AT)$ is at least ^. Denoting 

the other eigenvalues X a , \\(M ® AT)&\\ P = CE Q > and we are done. □ 



Lemma II.2 If the channel M is e-randomizing, then when p > 1, 



l + e xi-i/p 




Proof Clearly, M and 77 have the same maximum output p-norm. For the former, observe that 
the e-randomizing condition implies that for an arbitrary input state p, \\Af(p)\\oo < -jp. In other 
words, all the eigenvalues X a of the output state N{p) are bounded between and ^p. In addi- 
tion, because Af(p) is a density operator, the eigenvalues sum to 1. 

Subject to these constraints, however, the convexity of the function x h-> x p ensures that the p- 
norm ||AA(/9)|| p = (Yl a ^a) 1 ^ is maximized when the largest eigenvalue is ^jp and it occurs with 
multiplicity Ltt^J' anc ^ a ^ Du * possibly one remaining eigenvalue is 0. Thus, 

ii^(p)Hp = i>;>si <( 7^(^-1 ) =( i ^ 1 )" 1/P - as) 

□ 

Theorem II.3 Fix any < e < 1 and a family of e-randomizing maps M as in Eq. (fT?l > with n > 
134 d In d/e 2 . Then, for any p > 2 and sufficiently large d, 

v P (M)u p {M) < <-<v p (M®IT), (17) 

In other words, for this family of channels, the maximum output p-norm is strictly supermultiplicative for 
sufficiently large d when p > 2. 

Proof Follows from Lemmas III. 11 and 111.21 since 2 — 2/p > 1. □ 

These counterexamples to the multiplicativity of the output p-norm for p > 2 are interesting 
in that they are random unitary channels, which are among the simplest truly quantum maps. 
In fact, the first proofs of multiplicativity for unital qubit channels [22] and depolarizing chan- 
nels [24] exploited this type of structure. Indeed, unital qubit channels are always random unitary 
channels (with d = 2) [18]. Despite the fact that King showed multiplicativity for such channels 
at all p > 1 J22I], there is no conflict with the result here, as the bound on n becomes better than d 2 
only for rather large dimension d. 

We observe, furthermore, that p = 2 is indeed the limit of validity of this class of counterexam- 
ples, since n > d for any e-randomizing map. 



III. GENERIC QUANTUM CHANNELS: ALL p > 1 



Let E, F and G be finite dimensional quantum systems, then define R = E, S = FG, A = EF 
and B = G, so that RS = AB = EFG. Our second and stronger class of counterexamples will be 
channels from S to A of the form 

N(p) = Tr B [U(\m R ® P)rf] (18) 



for U unitary and |0) some fixed state on R. Another, slightly more flexible way of writing this is in 
the language of isometric Stinespring dilations: namely the Hilbert space isometry V ■ S <— > AB 
defined by V\tp) = U[\0) R \ip) s ). In this notation, to which we will adhere from now, M(p) = 
Ti B VpVl 

Our method will be to fix the dimensions of the systems involved, select U (i.e., the isometry 
V) at random, and show that the resulting channel is likely to violate additivity. The rough in- 
tuition motivating our examples is the same as in the previous section: we will exploit the fact 
that there are channels that appear to be highly depolarizing for product state inputs despite the 
fact that they are not close to the depolarizing channel in, for example, the norm of complete 
boundedness ji^j . 

Consider a single copy of M and the associated map V : \f) s h-> U(\0) R \f) s ) . This map takes 
S to a subspace of A ® B, and if U is selected according to the Haar measure, then the ima ge o f 
S is itself a random subspace, distributed according to the unitarily invariant measure. In [50], 
it was shown that if |5| is chosen appropriately, then the image is likely to contain only almost 
maximally entangled states, as measured by the entropy of entanglement. After tracing over B, 
this entropy of entanglement becomes the entropy of the output state. Thus, for S of suitable 
size, all input states get mapped to high entropy output states. We will repeat the analysis below, 
finding that the maximum allowable size of S will depend on p as described by the following two 
lemmas. 

Lemma III.l The maps f p (\<p)) = H p ((p A ) on unit vectors (states) \<p) E A <g) B, 2 < \A\ < \B\, have 
expectation 

E/ p > E/oo > In \A\ - ly /\A\/\B\, (19) 

for a uniformly random state <p, with a universal constant 7 which may be chosen arbitrarily close to 3 for 
sufficiently large \A\. 

Furthermore, for p > 1, the functions f p are all Lipschitz continuous, with the Lipschitz constant A p 
bounded above by 

A ^ JT^pJ^ 1 ' 1 - (20) 

Proof The first inequality in Eq. ((19)) is by the monotonicity of the Renyi entropies in p. For the 
second, observe fooiW)) = — In \\<j> \\oo, so 

E/oodp)) = E(- In \\v A \\J > - lnE H*/^ . 

The expectation of the largest eigenvalue of tp A has been widely studied in random matrix theory. 
Just note that \cp) is well-approximated by a Gaussian unit vector |T), that is, a random vector all 
of whose real and imaginary components (in any basis) are i.i.d. normal with expectation and 
variance 1/2|A||£?|. (See [51, Appendix].) Indeed, by the triangle inequality, 

Ej|</|| <E r ||r A || , 

V II ~ Moo — II Moo' 



and the right hand side, for large A and B, is known l|52l, 15311 to be asymptotically 
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The explicit upper bound of - \a\\b\ — nas been obtained for matrices with real Gaussian en- 
tries [54], but the analogous statement for complex Gaussian entries seems to be unknown. 

Now, for the Lipschitz bound: we proceed as in [50], inferring the general bound from a Lip- 
schitz bound for the Renyi entropy of a dephased version on ip A . Fix bases {\j}} and {\k}} of 
A and B, respectively, so that we can write \<p) = J2jk fjk\j)W)> where the coefficients are to be 
decomposed into real and imaginary parts: cpjf. = tj^ + it^\. 

We actually show that 



9p(\<p)) 



1 — p 



InTr 



EIjXjVIjXjI 



j j \ kz / 



is ^rjl^l 1 ^ 2 1 ^ 2p -Lipschitz. This implies the result for f p as follows. Note first that g p (\<p}) > 
f p (\ip}), with equality if {| j)} is an eigenbasis of (p A . Now, for two vectors \ip), we may without 
loss of generality assume that f p (\ip)) > fpi\f)), and that {|j)} is the eigenbasis of ip A . Thus, by 
assumption, 

/p(iv>» - urn < 9 P m - g P {W)) < ^a^-^u) - m 2 . 

To bound the Lipschitz constant of g p , it is sufficient to find an upper bound on its gradient. It 
is straightforward to see that 



kz 1 ~ p Ef ( E 



Of 



so introducing the notation xj = Ylkz tfkz' we have 



y ■ Zptjkz ( X)*ifcV ) 



p-1 



V E^f" 1 V E ? K) (2p - 1)/p 



PII2 



(i-p)J 



Ej x i 



2 ' 



which we need to maximize subject to the constraint Xj = 1. Since (2p — l)/p> 1, the function 
is convex. Therefore, for fixed s = ■ > the right hand side is maximal when 



y 



(2p-l)/p 



all the except for one are 0. Thus, 



| VfiTp 1 1 2 < max^ii-K^! - _ 



V J(2p-l)/p]-2 



4p2 



(1-P)' 



l^l 1 - 1 ^, 



and we are done. 



□ 



Lemma III.2 Let A and B be quantum systems with 2 < \A\ < \B\ and 1 < p < oo. Then there exists a 
subspace S C A® B of dimension 



\S\ 



(21) 



p7 1x1(5/5)' 

(with a universal constant c), that contains only states \ip) & S with high entanglement, in the sense that 

H p (<p A ) > In \A\- a -/3 + ln(l -5), (22) 
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where (3 = j^\A\/\B\ is as in Lemma 17777X1 The probability that a subspace of dimension \S\ chosen at 
random according to the unitarily invariant measure will not have this property is bounded above by 

2<^y Sl e W (^-c(l-^aM 1/p \B\ ] j. (23) 
The universal constant c may be chosen to be 1/727T 3 . 

Proof The argument is nearly identical to the proof of Theorem IV.l in |5(3l, but with an im- 
provement, possible due to the fact the we are looking at a function defined via a norm. (See |53] 
and ffl.) 

First of all, by Levy's Lemma, for a function / on pure states of A B with Lipschitz constant 
A, the random variable f(\p>}) for a uniformly distributed \tp) on the unit sphere in A (g> B obeys 

/ 2 a 2 

Pr{/ < E/ - a} < 2exp I \A\\B\ 



(See [50, Lemma III.l] for an exposition.) We apply this to f p , for which we have a Lipschitz bound 
by Lemma flII.ll Furthermore, we can find a 5-net M. of cardinality \M\ < (5/<5) 2 ' 5 ' on the unit 
vectors in S [50, Lemma III.6]. In other words, for each unit vector \ip) & S there exists a\<p) G M. 
such that || | ip) — \<p}\\2 < 5. Combining the net, the Lipschitz constant and the union bound, we 
get 

Pr{^)eM f p m<ln\A\-a/2-p} < (j^exp (-A (f \A\\B\) , 

which is the probability inequality claimed in the theorem. Moreover, the right hand side is less 
than 1 if IS* | is chosen as stated in the theorem. 

Now, assume we have a subspace S with a 5-net M. such that 

(V\<p) eM)(f p (<p)>ln\A\-a-f3), i.e. r := mzx {tp)eM \\ip A \\ p < e -(i-i/p)(ta|A|-a-0 (24 ) 
Denote 

d II A ii ii Aw 

K '■= ITI&X-^qs un i t vec t or \\tp \\p = maXp^Q supported on 5 \\P Wpi 

where the latter equality is due to the convexity of the norm. Hence, for each unit vector \<p) E S 
and corresponding \tp) € M such that \\tp — <p\\i < S, 

y A \\ P < \\& A \\p + y A - & A \\p <r + 5R, 

where we have used triangle inequality and the trace norm bound on ip — (p. Consequently, R < 
r/(l — $), and inserting that into Eq. ((24]> finishes the proof. □ 

Consider now the product channel J\f (8> N, where N(p) = Tr^ V pV T is the complex conjugate 
of M. We will exploit an approximate version of the symmetry used in the random unitary channel 
counterexamples. Fix orthonormal bases of S, A and B to be used in the definition of maximally 
entangled states involving these systems. (These have to be the same product bases with respect 
to which we define the complex conjugate.) 

In the trivial case where \S\ = \A (g> B\, the isometry V is unitary and the identity V V\Q) = 
(VV T ® = |$) for the maximally entangled state l^) 5152 implies that 

(AA®A0(|3>)($| Sl52 ) = Ti BlB2 [|$>($| AlA2 ® |$)($| BlB2 l = |$)($| AlA2 . (25) 
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The output of Af (8 will thus be a pure state. In the general case, we will choose |<S|/| A (8 B\ to 
be large but not trivial, in which case useful bounds can still be placed on the largest eigenvalue 
of the output state for an input state maximally entangled between S\ and 52- 

Lemma III.3 Let \<&) SlS2 be a state maximally entangled between S\ and S2 as in the previous paragraph. 
Then (M (8 N)(& SlS2 ) has an eigenvalue of at least Jff L . 

Proof This is an easy calculation again exploiting the U®U invariance of the maximally entangled 
state. Note that whereas V is an isometric embedding, W is a partial isometry. More precisely, it 
can be understood as a unitary W on A (8 B followed by a fixed projection P, say onto the first \S\ 
coordinates of A (8 B. Now, 

HCA/"® ^I^X^I 5152 )!^ > Tr {[(N ® N)\$){$\ SlS2 ]\§){$\ AlA2 ) 

> Tr ((V (8 V)\$)($\ SlS2 (V <8 V)\\$)(&\ AlA2 8) \<S>)($\ BlB2 )^j 

= Tr ((P (8) P)|$)($| 5l52 (P (8) P){U 8> U)^ (\§){§\ AlA2 (8 \<5>)(<$>\ BlB2 )(U (8) £/)) 

p\(\<h\/<h\ A i A 2 IrT.V.ftl-Bi-Ba^ _ 1^1 

lAllBl' 



Tr ((P®P)|$)($| SlS2 (P®P)(|$){$| ii;l2 (8 |$)($| BlB2 )) 



and we are done. □ 

In order to demonstrate violations of additivity, the first step is to bound the minimum output 
entropy from below for a single copy of the channel. Fix 1 < p < 00, let \B\ = \A\ so that (3 = 7, 
set a = S = 1/2, and then choose |S*| according to Lemma ttll.21 With probability approaching 1 as 

\A\ -» 00, 

Hf n {Af) > In |A| - 7 - 1/2 - In 2, (26) 

when the subspace S 1 defining the channel is chosen according to the unitary invariant measure. 
(Since we're interested in \A\ — ► 00, we may choose any 7 > 3.) The same obviously holds for 
H™ m (N). Recall that the entropy of the uniform distribution is ln\A\ so the minimum entropy 
is near the maximum possible. Fix a channel such that these lower bounds on H™ m (J\f) and 
H™ in (N) are satisfied. 

By Lemma MM 



l-p a )-\-p \\A\\B\) l-p V ; 

where the X a are the eigenvalues of (TV <8 A/')($). Substituting the value of \S\ from Lemma Hll.21 
into this inequality yields 

H P ((N®N)($)) <\n\A\+0{l + ^-^ln^\ < In \A\ + O ((1 - l/p)' 2 ) , (28) 

where the O notation hides only an absolute constant, independent of \A\ and p > 1. Thus, the 
Renyi entropy of (AA ® A/")($) is strictly less than H™ in (N) + H™ n {M) > 2 In |A| - 0(1). This is a 
violation of conjecture ||TT]) , with the size of the gap approaching In \A\ — 0(1) for large \A\. 

Theorem III.4 For all 1 < p < 00, there exists a quantum channel for which the inequalities d26l ) and 
(l28l > /zo/d. T/ze inequalities are inconsistent with the maximal p-norm multiplicativity conjecture. □ 
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Note, however, that changing p also requires changing \S\ according Lemma ttll.21 so we have 
a sequence of channels violating additivity of the minimal output Renyi entropy as p decreases to 
1, as opposed to a single channel doing so for every p. This prevents us from drawing conclusions 
about the von Neumann entropy by taking the limit p — ► 1. Likewise, an examination of Eq. j28)) 
reveals that we also lose control over the two-copy minimum output entropy of a fixed channel 
as p — » 1. 

Another observation comes from the fact that our examples violate additivity by so much: 
namely that, due to Lemma IIII.31 the dimension of the subspace S in Lemma IIII.2I is essentially 
optimal up to constant factors (depending on p). Any stronger violations of additivity would 
contradict Eq. (©, the inequality Hf n (Af®Af)> Hf n (Af) . 

As an aside, it is interesting to observe that violating maximal p-norm multiplicativity has 
structural consequences for the channels themselves. For example, because entanglement- 
breaking channels do not violate multiplicativity [56fl, there must be states \if;) SlS2 such that 
(Af I S2 )(il)) is entangled, despite the fact that Af will be a rather noisy channel. (The same con- 
clusions apply to the maps of section fll], where the conclusion takes the form that e-randomizing 
random unitary channels need not be entanglement-breaking.) 



IV. THE VON NEUMANN ENTROPY CASE 



Despite the large violations found for p close to 1, the class of examples presented here do not 
appear to contradict the minimum output entropy conjecture for the von Neumann entropy. The 
reason is that the upper bound demonstrated for H p ((Af <g> Af )($>)) in the previous section rested 
entirely on the existence of one large eigenvalue for (Af <8> Af)($>). The von Neumann entropy 
is not as sensitive to the value of a single eigenvalue as are the Renyi entropies for p > 1 and, 
consequently, does not appear to exhibit additivity violations. With a bit of work, it is possible to 
make these observations more rigorous. 

Lemma IV.l Let \&) SlS2 be a maximally entangled state between S\ and Si- Assuming that \ A\ < \B\ < 
\S\, 

J Tr [((Af® A7)(|d>)(d>|)) 2 ] dU = + O , (29) 

where "dU" is the normalized Haar measure onR®S = A®B. 

A description of the calculation can be found in Appendix [A] Let the eigenvalues of (Af Af)($>) 
be equal to Ai > A2 > • • • > X\a\ 2 - F° r a typical U, Lemmas IIII.3I and IIV. 1 1 together imply that 

x>?^(w)- (30) 

Thus, aside from Ai, the eigenvalues Xj must be quite small. A typical eigenvalue distribution is 
plotted in Figure [TJ If we define Xj = \j/(l — then Y^j>i A? = 1 an d 

Hi(X) > H 2 (X) = -m^Af = 2hx\A\ - 0(1). (31) 

An application of the grouping property then gives us a good lower bound on the von Neumann 
entropy: 

#i((AA®A7)($)) =H 1 (X)=h(Xi) + (l-X 1 )H 1 (X) = 2Jn\A\-0(l), (32) 
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Spectrum of the output density operator 

10° i ,— — ^- — ^- , — 




100 200 300 400 500 

Eigenvalue index 



FIG. 1: Typical eigenvalue spectrum of (AT <g> Af) ($) when \R\ = 3 and \A\ = \B\ = 24. The eigenvalues are 
plotted in increasing order from left to right. The green dashed line corresponds to |S , |/(|A||S|) = 1/3, which 
is essentially equal to the largest eigenvalue. The red solid line represents the value (1— JnL )/!^-! 2 = 1/864. 
If the density operator were maximally mixed aside from its largest eigenvalue, all but that one eigenvalue 
would fall on this line. While that is not the case here or in general, the remaining eigenvalues are nonetheless 
sufficiently small to ensure that the density operator has high von Neumann entropy. 

where h is the binary entropy function. This entropy is nearly as large as it can be and, in particu- 
lar, as large as H mm (M) + H mm (M) according to Theorem IV.l of 15011 . the von Neumann entropy 
version of Lemma [111.21 

V. DISCUSSION 

The counterexamples presented here demonstrate that the maximal p-norm multiplicativity 
conjecture and, equivalently, the minimum output p-Renyi entropy conjecture are false for all 
1 < p < oo. The primary motivation for studying this conjecture was that it is a natural strength- 
ening of the minimum (von Neumann) output entropy conjecture, which is of fundamental im- 
portance in quantum information theory. In particular, since the multiplicativity conjecture was 
formulated, most attempts to prove the minimum output entropy conjecture for special cases ac- 
tually proved maximal p-norm multiplicativity and then took the limit as p decreases to 1. This 
strategy, we now know, cannot be used to prove the conjecture in general. 

From that perspective, it would seem that the results in this paper cast doubt on the validity 
of the minimum output entropy conjecture itself. However, as we have shown, the examples 
explored here appear to be completely consistent with the conjecture, precisely because the von 
Neumann entropy is more difficult to perturb than the Renyi entropies of order p > 1. It is 
therefore still possible that the p = 1 conjecture could be demonstrated using subtle variants 
of p-norm multiplicativity such as exact or approximate multiplicativity in a channel-dependent 
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interval (1,1 + 6). 

Another strategy that is still open would be to approach the von Neumann minimum output 
entropy via Renyi entropies for p < 1. It is possible that additivity holds there even as it fails for 
p > 1. That is not, unfortunately a very well-informed speculation. With few exceptions [57], 
there has been very little research on the additivity question in the regime p < 1, even though 
many arguments can be easily adapted to this parameter region. (Eq. ((13)), for example, holds 
for all < p.) Unfortunately, since the time the examples presented here were first circulated, 
counterexamples for p close to were also discovered [58], casting doubt on the conjecture for the 
whole set of Renyi entropies with p < 1. Indeed, as in the current paper, those examples are based 
on influencing a single eigenvalue of the output state of the tensor product channel; while here 
we increase the largest one, there the smallest is suppressed. 

Thus, while it seems doubtful that the examples of channels presented here will have direct 
implications for the addivity of the minimum von Neumann entropy, we think that they are still 
very useful as a new class of test cases. Indeed, as we remarked earlier, our examples eliminate 
what had been the previously favoured route to the conjecture via the output p-norms. 

As a final comment, while this paper has demonstrated that the maximal p-norm additivity 
conjecture fails for p > 1, all the counterexamples presented here have been nonconstructive. For 
the examples based on e-randomizing maps, all the known explicit constructions (by Ambainis 
and Smith J5^] or via iterated quantum expander maps (6(iLl6lll ) only give bounds in the 2-norm, 
which do imply bounds on the output p-norm but those are too weak to yield counterexamples to 
multiplicativity. Likewise, the counterexamples based on generic quantum channels rely on the 
existence of large subspaces containing only highly entangled states. Even when the entangle- 
ment is quantified using von Neumann entropy, in which case the existence of these subspaces 
was demonstrated in 2003 [501, not a single explicit construction is known. The culprit, as in 
many other related contexts [62], is our use of the probabilistic method. Since we don't have any 
explicit counterexamples, only a proof that counterexamples exist, it remains an open problem to 
"derandomize" our argument. 
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APPENDIX A: PROOF OF LEMMA JVJ] 



We will estimate the integral, in what is perhaps not the most illuminating way, by expressing 
it in terms of the matrix entries of U. Let i7 g)0 & = R (0\ s (s\U\a) A \b) B . Expanding gives 

Tr 



((AA®Ar)(|$)($|)) z j dU (Al) 



a l. a 2 i)!,6 2 s l> s 2 
a 'l> a 2 61,63 S 'l' S 2 



Following |I63|, 16411 , the non-zero terms in the sum can be represented using a simple graphical 
notation. Make two parallel columns of four dots, then label the left-hand dots by the indices 
(si, s 2 , s^, s 2 ) and the right-hand dots by the indices v = (a 2 b 2 , a[bi, a' 2 b' 2 , aib[). Join dots with a 
solid line if the corresponding U matrix entry appears in Eq. (|A1|) . Since terms integrate to a non- 
zero value only if the vector of U indices w = (a\bi, a 2 b2, a! x b' x ,a 2 b 2 ) is a permutation of the vector 
of U indices, a non-zero integral can be represented by using a dotted line to connect left-hand 
and right-hand dots whenever the corresponding U matrix entry appears in the integral. 

Assuming for the moment that the vertex labels in the left column are all distinct and likewise 
for the right column, the integral evaluates to the Weingarten function Wg(7r), where tt is the 
permutation such that Wi = v^uy For the rough estimate required here, it is sufficient to know 
thatWg(vr) = ®({\A\\B\)-^-^) , where | 7r| is the minimal number of factors required to write tt as 
a product of transpositions, and that Wg(e) = (\A\\B\)~ 4 (l + 0(\A\- 2 \B\- 2 )) 

The dominant contribution to Eq. jAl} comes from the "stack" diagram 



si 


• 


• 


O202 


= a\b\ 




• 


• 


a[bi 


= a' 2 b 2 






-i 


• 


• 


a' 2 b' 2 


= a;6i 






s> 2 


• 


• 


aib[ 


= a 2 b' 2 , 







in which the solid and dashed lines are parallel and for which the contribution is positive and 
approximately equal to 



j^EEE S aia2 6 blb2 S a[a , 2 S m Wg(id) = |^LL_ (1 + 0(\A\- 2 \B\- 2 )) . (A2) 

a l- a 2 bifa s l' s 2 I I I I 

a 'l< a 2 b[,b' 2 s'i>4 

(The expression on the left-hand side would be exact but for the terms in which vertex labels are 
not distinct.) To obtain an estimate of Eq. ^Al} , it is then sufficient to examine the other terms and 
confirm that they are all of smaller asymptotic order than this. There are six diagrams representing 
transpositions, and their associated (negative) contributions are 

s\ • -• a 2 b 2 = a 2 b 2 

X ' I r 11 

S2 • \ 1 — • a\bi = a 2 »2 
a' 2 b 2 = a'^ 
a\b\ = a\bi, 



81 


• • 


a 2 b 2 = 


aih 


Sl 


• 


-• 


a 2 b 2 = 


aibi 




• • 


ai&i = 


a' 2 b 2 


S2 


• 


-• 


a[bi = 


a 2 b 2 




\ 




s'i 




a' 2 b 2 = 


a 2 b' 2 


s'i 


\ / 
• X — 


-• 


a' 2 b' 2 = 


a[b[ 




— -• 




/ \ 






/ \ 
^» 


aib[ = 


a'A 


s' 2 




N . 


a^i = 


a' 2 b 2 














®(\S\ 2 \A 


- 4 |i?|- 


2 ) 




@(\S\ 2 


.4 


- A \B\- 


4 ) 



• — 

\ 

\ 

• — 


• 

/ 

/ 




\ / • 




X 


• — 


J-± — • 


/ 





Gdsi 2 ^)- 2 ^!- 4 ) 
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s i • • 02^2 = aibi s\ •- -• a 2 b 2 = a[b{ s\ »^ -• a 2 b 2 = a' 2 b2 

S2 •- -• a[bi = a[b[ s 2 • — * — • a[bi = a' 2 b 2 s 2 •- -• a[bi = a\b\ 

s' x •- -• a' 2 b' 2 = a' 2 b 2 s[ •- -• a' 2 b' 2 = a\b\ s[ • • a' 2 b 2 = a'^ 

s 2 • • a\b' x = a 2 b 2 s' 2 • • aib[ = a 2 b' 2 s 2 • • a\b' x = a 2 b 2 

e{\s\ 2 \A\~ 2 \B\- 4 ) Gdsi 2 ^!- 4 !^!- 4 ) o{\s\ 2 \a\-*\b\- 2 ). 



For permutations it such that \n\ > 1, the Weingarten function is significantly suppressed: 
Wg(7r) = C(|A| _6 |i?| _6 ). Moreover, for a given diagram type, the requirement that Wi = 
can only hold if at least two pairs of the indices ai, a 2 , a[, a 2 , b\,b 2 , b[,b 2 are identical. The contri- 
bution from such diagrams is therefore C( | S"! 2 1 A| -4 |l?|~ 2 ). 

To finish the proof, it is necessary to consider integrals in which the vertex labels on the left- or 
the right-hand side of a diagram are not all distinct. In this more general case, choosing a set C of 
representatives for the conjugacy classes of the permutation group on four elements, the value of 
the integral can be written 

£iV(c)Wg(c), (A3) 



where 



v=a(v) w=t(w) 

These formulas have a simple interpretation. Symmetry in the vertex labels introduces ambigu- 
ities in the diagrammatic notation; the formula states that every one of the diagrams consistent 
with a given vertex label set must be counted, and with a defined dimension-independent multi- 
plicity. Conveniently, our crude estimates have already done exactly that, ignoring the multiplici- 
ties. The only case for which we need to know the multiplicities, moreover, is for contributions to 
the dominant term, which we want to know exactly and not just up to a constant multiple. 

We claim that in the sum (|A1[) there are at most C(|S'| 4 |A||i3| 3 ) terms with vertex label symme- 
try. The total contribution for terms with vertex label symmetries r and a in which \tit(t\ > 1 is 
therefore of size C ( | ^ | 2 1 ^4 1 4 1 1 2 ) and does not affect the dominant term. To see why the claim 
holds, fix a diagram type and recall that the requirement Wi = v^u) for a permutation it can only 
hold if at least two pairs of the indices a\,a 2 , a[ ,a 2 ,b\,b 2 , b[ , b 2 are identical. Equality is achieved 
only when all the A indices or all the B indices are aligned, corresponding to the following two 
diagrams: 

s\ •- -• a 2 b 2 = a' 2 b 2 si • f a 2 b 2 = a 2 b 2 

, \ \ / 

S2 •- -• a[bx = ai&i $2 » ^ \ / a[bi = a x b x 



3i •- -• a 2 b 2 = a2&2 s i * ^ 1 N ^ * a 2^2 = a 2^2 

\ / \ 

s f 2 •- -• a\b x = a x b x s 2 • • aib[ = a±b\ 



•- 






-• 


\ 


\ 


/ 


/ 


•- 






-• 




. \ 


/ . 

< 




•- 


'—L 


\ • 


-• 




/ 


\ 




.- 









For the first diagram, using the fact that \A\ < \B\ < \S\, it is easy to check that imposing the 
extra constraint that either the top or bottom two S or AB vertex labels match singles at most 
0(|S'| 4 |j4||i?| 3 ) terms from Eq. (|AH> . Similar reasoning applies to the second diagram, but impos- 
ing the constraint instead on rows one and four, or two and three. For all other diagram types, at 
least four pairs of the indices a\,a 2 , a[ , a 2 ,b\,b 2 , b' x , b 2 are identical. (The number of matching A 
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and B indices is necessarily even.) In a term for which the vertex labels are not all distinct, either a 
pair of S indices or a further pair of A or B indices must be identical. In the latter case, there must 
exist an identical A pair and an identical B pair among all the pairs. Again using \ A\ < \B\ < \S\, 
there can be at most O ( | 4 1 J? | 3 ) such terms per diagram type, which demonstrates the claim. 

We are thus left to consider integrals with vertex label symmetry and N(e) ^ in Eq. (|A3|) . If 
N(e) = 1, then our counting was correct and there is no problem. It is therefore sufficient to bound 
the number of integrals in which N(e) > 1. This can occur only in terms with at least 2 vertex label 
symmetries. Running the argument of the previous paragraph again, for the two diagrams with 
A or B indices all aligned, this occurs in at most 0(|S| 4 |-B| 2 ) terms. For the rest of the cases, it is 
necessary to impose equality on yet another pair of indices, leading again to at most 0(|S| 4 |-E>| 2 ) 
terms. Since Wg(e) = Od^l -4 ]!?! -4 ), these contributions are collectively C(|S'| 2 |^4|~ 4 |i?|~ 2 ). 

The bound on the error term in Eq. ((29|) arises by substituting the inequalities \S\ < \A\ \B\ and 
\A\ < \B\ into each of the estimates calculated above. 
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